ImGCL: Revisiting Graph Contrastive Learning on Imbalanced Node Classification

نویسندگان

چکیده

Graph contrastive learning (GCL) has attracted a surge of attention due to its superior performance for node/graph representations without labels. However, in practice, the underlying class distribution unlabeled nodes given graph is usually imbalanced. This highly imbalanced inevitably deteriorates quality learned node GCL. Indeed, we empirically find that most state-of-the-art GCL methods cannot obtain discriminative and exhibit poor on classification. Motivated by this observation, propose principled framework Imbalanced classification (ImGCL), which automatically adaptively balances from Specifically, first introduce online clustering based progressively balanced sampling (PBS) method with theoretical rationale, training sets pseudo-labels obtained We then develop centrality PBS better preserve intrinsic structure graphs, upweighting important graph. Extensive experiments multiple datasets settings demonstrate effectiveness our proposed framework, significantly improves recent methods. Further experimental ablations analyses show ImGCL consistently representation under-represented (tail) classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

Active Learning for Imbalanced Sentiment Classification

Active learning is a promising way for sentiment classification to reduce the annotation cost. In this paper, we focus on the imbalanced class distribution scenario for sentiment classification, wherein the number of positive samples is quite different from that of negative samples. This scenario posits new challenges to active learning. To address these challenges, we propose a novel active le...

متن کامل

Semi-Supervised Learning for Imbalanced Sentiment Classification

Various semi-supervised learning methods have been proposed recently to solve the long-standing shortage problem of manually labeled data in sentiment classification. However, most existing studies assume the balance between negative and positive samples in both the labeled and unlabeled data, which may not be true in reality. In this paper, we investigate a more common case of semi-supervised ...

متن کامل

On Contrastive Divergence Learning

Maximum-likelihood (ML) learning of Markov random fields is challenging because it requires estimates of averages that have an exponential number of terms. Markov chain Monte Carlo methods typically take a long time to converge on unbiased estimates, but Hinton (2002) showed that if the Markov chain is only run for a few steps, the learning can still work well and it approximately minimizes a d...

متن کامل

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i9.26319